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ABSTRACT 

Computer techniques have been developed for mapping water quality 
parameters from LANDSAT data, using surface samples collected in 
an ongoing survey of water quality in Saginav/ Bay (Lake Huron), 
Michigan, sponsored by the US Environmental Protection Agency. 
Chemical and biological parameters were measured on 31 July 1975 
at 16 bay stations in concert with the LANDSAT overflight. Application 
of stepwise linear regression to seven of these parameters and corres- 
ponding LANDSAT measurements resulted in regression correlation 
coefficients that varied from 0. 94 for temperature to 0. 71 for Secchi 
depth. Chloride, conductivity, total Kjeldahl nitrogen, total phosphorus , 
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and chlorophyll a were best correlated with the ratio of LAN.DSAT Band 
4 to Band 5* Temperature and Secchi depth were best correlated to 
Band 5. Results of the regression analysis were used to map the water 
quality parameters over the entire bay, 

INTRODUCTION 

In response to environmental requirements for large -area surveillance 
of water quality and watershed land use, NASA's LANDSAT-2 investi- 
gation is establishing the cost benefits of LANDSAT to the surveillance 
and control of lake eutrophication in the Great Lakes Basin. To 
accomplish this objective, LANDSAT data products are being gener- 
ated to support the United States Environmental Protection Agency 
(EPA) modeling study of eutrophication in Saginaw Bay; the State of 
Michigan's survey of inland lakes and watersheds for the purpose of 
assessing the effects of watershed land use on lake water quality; and 
the State of Wisconsin's lake survey to determine eutrophication status, 
causes, effects, and control treatments. One goal of this work is to 
determine which water quality parameters best correlate with LAND- 
SAT measurements. This paper reports on results directed toward 
this goal using water quality parameters and LANDSAT data acquired 
on 31 July 1975 in the Saginaw Bay test area. 

Several institutions and federal agencies in the United States and 
Canada are conducting a comprehensive survey of water quality and 
circulation in Lakes Huron and Superior (the Upper Lakes Reference 
Study, a part of the United States /Canadian Great Lakes Water Quality 
Agreement of 1972). In Saginaw Bay (Lake Huron), EPA is sponsoring 
a 36-month modeling study of water quality. EPA's program will 
develop a deterministic model that will describe water quality changes 
within the bay and their relationships to enrichment and pollution 
caused by man. The resulting model will be used to evaluate various 
strategies to control nutrient flow into the bay. Important goals in this 
project are to describe, on a seasonal basis, the circulation and water 
quality in Saginaw Bay, to monitor inputs of nutrients from its water- 
shed, and, ultimately, to develop and evaluate models for predicting 
water quality in the bay as a function of various control strategies. 

A number of investigators have reported on the feasibility of deter- 
mining various water quality parameters from LANDSAT data. Klemas 
(Ref 1)* has used Secchi depth and suspended sediment measurements 
as correlated with Band 5 image radiance to map turbidity and circula- 
tion patterns in Delaware Bay. Yarger (Ref 2) has processed multidate 
digital data for reservoirs in Kansas to study the effects of sun angle 
change, and has studied the 5/4, 6/4, and 7/4 band ratios to predict 
suspended solids (ppm) at unsampled areas from ground truth samples. 
Johnson (Ref 3) has applied the equations for predicting suspended 
sediment derived from stepwise regression analysis of ground truth 
data from Delaware Bay to image data for Chesapeake Bay and has 
found reasonable agreement with ground truth measurements for 
Chesapeake Bay. 

^’■References are located at the end of this paper. 
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This LANDSAT-2 investigation has produced two previous reports (Refs 
4 and 5). The first discusses the applications of a supervised com- 
puter processing technique to a 3 June 1974 L.ANDSAT scene (1680- 
15455) to produce a geometrically-corrected color-coded image of 
Saginaw Bay which shows nine discrete categories of turbidity, as 
indicated by nine Secchi depths between 0, 3 and 3. 3 meters. This first 
effort was limited to the consideration of the Secchi depth parameter as 
an indicator of turbidity and the application of the supervised process- 
ing technique to correlate the Secchi depth parameter to the LANDSAT 
measurements* More recently (Ref 5), this investigation reprocessed 
the same June 1974 scene to develop relationships between 12 of the 
water quality parameters and the LANDSAT measurements by a step- 
wise linear regression program (Ref 6). This first application of the 
regression technique only evaluated the four LANDSAT bands as the 
independent variables and concluded that Band 6 alone was sufficient 
and the most important band for predicting values of most of the water 
quality parameters. This paper extends this analysis by using the ten 
independent variables; the LANDSAT bands individually (Bands 4, 5, 6, 
and 7); and the six non-redundant ratios (Bands 4/5, 4/6, 4/7, 5/6, 

5/7, and 6/7) in the regression program to process the 3 June 1974 
scene and a 31 July 1975 scene (2190-15404). This analysis shows that 
a ratio of bands is more strongly correlated to most water quality 
parameters. 

Mapping specific ranges of a water quality parameter by a supervised 
technique requires subdividing the parameter of interest into discrete 
categories, locating and editing LANDSAT measurements correspond- 
ing to each category, and applying the training measurements to cate- 
gorize other LANDSAT picture elements ("pixels”). This technique is 
well established and is by far the most efficient procedure for mapping 
land-cover categories, i.e. , urban, grassland, bare soil, water, etc., 
in which the spectral characteristics of the categories are very different 
(uncorrelated). The application of this same processing technique to 
mapping the amount or concentration of a continuous water quality para- 
meter, i.e., Secchi depth, chlorophyll concentration, etc., may not be 
justified as better estimates of the parameter may be obtained with less 
effort by a simpler technique. If a. continuous equation can be estab- 
lished between the parameter and LANDSAT, e. g. , the equation of a 
straight line with one independent and one dependent variable, its 
solution would provide many more estimates of the desired parameter 
than would be practical by the supervised technique, which requires a 
training set for each discrete value of the parameter. This paper 
investigates this possibility by applying a stepwise linear regression 
program to seven water quality parameters and LANDSAT measure- 
ments observed on 31 July 1975 at l6 stations in Saginaw Bay (Table 1). 

TEST area 

Saginaw Bay is a shallow extension of Lake Huron and is bounded by 
five counties of southeastern Michigan (Figure 1). The bay has an area 
of some 2, 960 km^ and a maximum length and width of 82 km and 42 km, 
respectively. The mean depths are 4. 6 m for the inner bay and 14. 6 m 
for the outer bay. The Saginaw River enters the bay at its extreme 
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Figure 1. Map of Saginaw Bay, Showing the 6l Water 

Quality Stations by the Symbol O . The Cruise 
Tracks of the Two Vessels and the 16 Stations 
Sampled on the Day of the LANDSAT Overpass 
(31 July 1975) are also Shown. 

southwestern end and contributes approximately 90% of the pollutants 
found in the bay (Ref 7). This river and its tributaries drain a water- 
shed of more than 16, 060 km^ and contain four major cities and much 
agricultural land. Consequently> inx^uts of salts, nutrients, and pol- 
lutants to the bay have been increasing for many years. Levels of 
turbidity and algal production are consistently high, especially within 
the inner bay. Major declines in commercial fish yields, wildfowl 
populations, and esthetic values have resulted from this eutrophication. 
The natural estuarine -like movement of pollutants from the bay into 
southern Lake Huron may also reduce water quality throughout the 
lower Great Lakes. While circulation within the bay is highly wind- 
dependent, the pattern is generally counterclockwise. Clear Lake 
Huron water enters mainly along the western shore; turbid bay water 
exists along the eastern shore. Significant but unknown quantities of 
sediment are re-suspended regularly by wave action. The lower two- 
thirds of Saginaw Bay usually freezes over during January and February. 
These and other characteristics of Saginaw Bay have been documented 
by Freedman (Ref 7). 


5 




WATER SAMPLE PROGE AM 


The EPA measurement program in Saginaw Bay is creating a data bank 
of water quality information that is being used to develop and test models 
of circulation, nutrient loadings, and algal productivity. Since April of 
1974, surface and subsurface measurements have been obtained at the 
6l bay stations shown in Figure 1, at 18-day intervals coinciding with 
LANDSAT overflights. Normally, three consecutive days are required 
to sample all stations. For this study, only measurements from sta- 
tions sampled the day of the LANDSAT overpass were processed. 

These l6 stations are shown in Figure 1 linked by solid lines indicating 
the tracks of the two sample boats. The sampling began about three 
hours before and ended about eight hours after the satellite overpass. 

On-site measurements at each bay station include temperature, pH, 
dissolved oxygen, conductivity, alkalinity, and water clarity. Clarity 
is indicated by Secchi depth and percent transmittance measurements. 
Variables measured in the laboratory include soluble nutrients (nitrate - 
rCtrlte, orthophosphate, sulfate, silicate, and ammonia), organic 
mat 3 rials (nitrogen, phosphorus, carbon, and chlorophylls), chloride 
and metals (sodium, potassium, calcium, magnesium, and six trace 
metals), and total suspended solids. Enumerations of phytoplankton and 
zooplankton are also made. Coordinated studies of current patterns, 
nutrient inputs, and bottom fauna are also underway by EPA. 

COMPUTER PROCESSING OF LANDSAT DATA 

The LANDSAT computer ^compatible tapes (CCTs) for this investigation 
were processed on the Bendix Multispectral Data Analysis System 
(MDAS) (Ref 4). Four major processing steps were involved; (1) trans- 
forming the locations of the bay stations from navigation charts to 
LANDSAT CCT coordinates, (2) extracting the LANDSAT digital meas- 
urements from the CCTs for each of the l6 bay station areas, (3) 
applying stepwise regression to the water quality parameters and 
LANDSAT measurements derived from each bay station area, and (4) 
producing a ratio image of the regression results on the film recorder. 

Ear t h-to-L-ANDSAT coordinate transformation 

Three basic steps were involved in the automatic referencing of ground 
coordinates to LANDSAT coordinates. The first step consisted of 
automatically retrieving the latitude and longitude of carefully selected 
ground control points (GCPs) from a map through a digitizing process. 
The criteria for selecting these GCPs is that they can be easily and 
accurately identified on LANDSAT imagery. The second step consisted 
of converting the latitude and longitude of these GCPs to LANDSAT co- 
ordinates by using a theoretical transformation derived from known and 
assumed spacecraft parameters, including heading, scan rate, altitude, 
and a knowledge of earth rotation parameters. The LANDSAT coordi- 
nates and transformation matrices thus obtained are approximate, based 
on the use of the nominal spacecraft parameters. The approximately- 
derived LANDSAT coordinates and transformation are used, howeve”, 
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to identify the actual LANDSAT coordinates associated with the GCPs. 

To accomplish this, the coordinates of a GCP are input to the Bendix 
MDAS. The approximate transformation computes the BANDSAT co- 
ordinates and displays the area on the TV monitor. Positional errors 
of the GCPs displayed to the operator are designated by a cursor to the 
computer, which uses the error measurement to derive an improved 
set of coefficients for the transformation matrix. This procedure is 
repeated on additional GCPs until the desired geometric accuracy is 
achieved. This investigation used 20 GCPs within the LANDSAT scene, 
The resulting bay station coordinates were transformed to LANDSAT 
coordinates with an error of less than one picture element (pixel). A 
LANDSAT pixel corresponds to an area of 57 by 79 m (0. 44 hectares). 

LANDSAT measurements from station areas 

The MDAS TV monitor was used to display the single pixel best corres- 
ponding to each bay station location. A cursor was then positioned, 
expanded, and shaped by the operator about each station site to desig- 
nate a station area of 60 to 100 pixels in size. Once the station areas 
were designated, the MDAS computer extracted the measurements from 
all pixels defined by the cursor and calculated the mean digital count in 
each band (Table 1). For the table shown, the digital counts from the 
standard LANDSAT CCT have been multiplied by two in Bands 4, 5, and 
6 and by four in Band 7, The six non-redundant ratios of the four 
LANDSAT bands were calculated from the station area means. The 
mean values of the digital counts in each LANDSAT band and the ratios 
of the means for each bay station were then stored in a disk file for use 
in the regression analysis. 

Stepwise regression analysis 

The LANDSAT measurements stored on the disk file were used in a 
stepwise linear regression program (Ref 6) to investigate relationships 
between the LANDSAT measurements and each of seven water quality 
parameters. The stepwise regression procedure first determined 
which single independent variable (one of the four LANDSAT bands or the 
six ratios) provided the best statistical correlation with the dependent 
variable (one of the water quality parameters). In successive steps, 
a second independent variable was added, if necessary, to improve the 
multiple correlation. 

Filming water quality parameters 

MDAS was used to produce a color- coded image of the ratio of Band 4 to 
Band 5, which v/as determined by the regression procedure to be the 
single best variable for mapping five of the seven water quality para- 
meters. The ratio image was viewed with 24 color levels on the MDAS 
TV display. These levels were grouped to seven final levels. The 
ratio image, produced on an Optronics film i-ecorder, was used with 
the regression equations to mterpret the various water quality para- 
meters over the bay. 
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S TATIS TICAIi RESULTS 

The results of applying the regression program to the 31 July 1975 
scene are summarized in Table 2. The results for only the first step 
of the regression are reported since, in every case, the correlation 
coefficient was not significantly improved by adding other independent 
variables. The constants, coefficients, and most significant band or 
ratio of bands listed in the table may be used to predict the dependent 
variable. For example; 

Chloride (mg/l) = 99.27 - 57. 138 (Band 4/Band 5) 

The regression correlation coefficient noted in the table provides a 
measure of the fit of the regression equation to the data and has a 
maximum value of unity. The standard error of estimate has the same 
units as the dependent variable and is the statistical standard deviation. 
Approximately 68% of the measurements are expected to be within one 
standard deviation of the mean. 

Table 2. Regression Results for Saginaw Bay on 31 July 1975 
(16 Stations) 


Dependent 
Variable fUnits) 

Equation 

Constant 

Equation 

Coefficient 

independent 

Variable 

(LANDSAT'Band or 
Btand Ratio) 

Regression 

Correlation 

Coefficient 

(r) 

Standard 
Error of 
Estimate 

Temperature (°G) 

9.688 

0.5304 

Band 5 

0.944 •* 

0.6560 

Secchi Depth (m) 

9.707 

- 0.3010 

Band 5 

0.711 * 

0.9650 

Chloride (mg/l) 

99.27 

• 57.138 

Band 4 to Band 5 

0.922 »* 

1.891 

Conductivity (micromhos) 

650.0 

^ 264.82 

Band 4 to Band 5 

0.920 ** 

8.876 

Total Kjeldahl 
Nitrogen (mg/l) 

4,455 

- 2.664 

Band 4 to Band 5 

0.934 •* 

0.0799 

Total Phosphorus (mg/l) 

0.1846 

- 0.1112 

Band 4 to Band 5 

0.914 

0,0039 

Clitorophyl) a (ug/l) 

303.1 

- 189.4 

Band 4 to Band 5 

0.869 

8.455 


*■ Significant at P< 0.01 (r = 0,606) 

** Significant at P< 0.001 (r = 0.725) 

Figure 2 shows the correlation between one of the seven water quality 
parameters (dependent variable) and the LANDSAT Band 4 to Band 5 
ratio (independent variable). The figure shows the 1 6 sample points, 
the regression line and equation, the standard estimate of error (dashed 
lines), and the regression correlation coefficient (Table 2). 

Figure 3 provides a visual evaluation of the overall fit of the regression 
equation for one of the seven water quality parameters. The regression 
equation and the appropriate independent variable values (Band 4 to 
Band 5 ratios) shown in Table 1, were used to arrive at predicted values 
of each water quality parameter, These predicted values, were, in 
turn, plotted against the measured values. If the regression equation 
resulted in a perfect correlation, all points would fall on a straight line 
having unity slope. The standard error of estimate is shown in Figure 
3 as the dashed lines. 
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DISCUSSION 

As reported previously (Ref 5), when the regression program was 
applied to the 3 June 1974 scene with the four LANDSAT bands as the 
independent variables, Band 6 alone was chosen by the regression 
procedure as providing the best correlation with most water quality 
parameters. Both Yarger (Ref 2) and Johnson (Ref 3) also found this 
to be true in Kansas reservoirs and in Chesapeake Bay, repectively. 

It was suggested (Ref 5) that the selection of Band 6, in the stepwise 
regression analysis, may have been caused by the noticeable atmos- 
pheric haze which causes non-uniform measurements of scene 
radiance and degrades the correlations in the two visible bands, Bands 
4 and 5. When the four LANDSAT bands and the ratio of bands were 
applied as independent variables to the 3 June 1974 scene, a ratio of 
bands was chosen by the regression program as providing the best 
correlation with most water quality parameters. The fact that a ratio 
is a best choice for mapping water quality parameters was again veri- 
fied by using bands and the ratios of bands in regression processing 
the 31 July 1975 scene, the results of which are observed in Table 2 
and the figures of this paper. Table 2 shows that the ratio of Band 4 
to Band 5 is sufficient to map the concentrations and distribution of 
chloride, conductivity, total Kjeldahl nitrogen, total phosphorus, and 
chlorophyll a with a correlation coefficient of 0. 869 or more. Tem- 
perature and Secchi depth preferred Band 5 alone. 

It should be emphasized that LANDSAT measures color or volume 
reflectance of the water and does not, for example, measure tempera- 
ture, chloride, conductivity, etc. directly. Only a few water quality 
parameters; chlorophylls, algal populations, and particulate carbon; 
have a direct influence on the color or volume reflectivity of Saginaw 
Bay water. However, the other water quality parameters do correlate 
secondarily with color or volume reflectance to the extent that they all 
characterize the same water masses. 

Figure 4 shows an image of the ratio of Band 4 to Band 5. The concen- 
tration and distribution of five water quality parameters can be inter- 
preted directly from this image. The ratio values are indicated next to 
the appropriate breaks in the gray scale. The corresponding values for 
each of the water quality parameters have been calculated from either 
the equations shown in Table 2 or from plots of each variable, such as 
those shown in Figure 2. It should be emphasized that these values are 
accurate only within the ranges of standard error indicated in Table 2. 

Analysis of the ratio image has confirmed some known features of cir- 
culation and water quality in Saginaw Bay. Previous surveys of the bay 
(Ref 7) have indicated that the predominant flow of Saginaw River water 
is northward along the eastern shore of the bay. Less turbid Lake Huron 
water dominates the outer bay and enters the inner bay chiefly along the 
the western shore. Zones of mixing and local circulation are apparent 
on the map, as are shoal areas where sediment evidently has been re- 
suspended. 
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Saginaw Bay on 31 July 1975 
Part of LANDSAT Scene 2190-15404 



Figure 4. Ratio Image and Predicted Water Quality of 
Saginaw Bay (Seven Gray Levels) 
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A number of prominent water mass patterns can be identified from 
Figure 4. A lobe -shaped body of relatively clean Lake Huron water is 
located near the north edge of the ratio image. More turbid water 
enters the bay from the Saginaw River and flows counterclockwise along 
the shore. Also present are a few seemingly isolated areas of clean 
water within the bay. For example, note the area northwest of the most 
prominent point of land along the eastern shore. 

The ratio image has been keyed to five of the 30 water quality parameters. 
The concentration and distribution of other water quality parameters 
may be obtained by their correlation with the five parameters mapped 
here. For example, sodium is highly correlated with conductivity and, 
if the relationship between sodium and conductivity is used, the con- 
ductivity map (ratio image) can also be transformed into a map showing 
the concentration of sodium over the bay. Similarly, the distribution of 
four major metals can be interpreted by their correlation with the 
chloride map. 

The significance to water quality of each parameter measured in Saginaw 
Bay varies with the location and season. In general terms, however, 
the following applies to the July 1975 data. Temperature (also chemical 
and biological) gradients in the bay during the spring reflect the mixing 
of eutrophic Saginaw River water with oligotophic (and cooler) Lake 
Huron water. Thus, temperature maybe used coincidentally to dis- 
criminate between waters of markedly different quality. Secchi depth 
estimates are used to approximate water transparency as affected by 
suspended particles and solutes. Chlorophyll a is also an approximate 
indicator of living algal biomasses. Conductivity, which varies directly 
with the concentration of dissolved ions, is generally high in eutrophic 
or polluted waters. Similarly, chloride is used here as a conservative 
tracer of enriched Saginaw River water. 

CONCLUSIONS 

LANDSAT digital data and ground truth measurements for Saginaw Bay 
(Lake Huron), Michigan, for 31 July 1975 can be correlated by stepwise 
linear regression and the resulting equations used to estimate "invisible" 
water quality parameters in nonsampled areas. The correlation of 
these parameters with one another indicates that the transport of 
Saginaw River water can now be traced by a number of water quality 
parameters, one or more of which are directly detected by LANDSAT. 
Chloride, conductivity, total Kjeldahl nitrogen, total phosphorus, and 
chlorophyll a are best correlated with the ratio of LANDSAT Band 4 to 
Band 5. Temperature and Secchi depth correlate best with Band 5. 

Water quality parameters mapped from the linear regression equations 
will indicate which water quality parameter(s) is most reliable as a 
tracer to identify Saginaw River water as it circulates throughout the 
bay and is diluted by Lake Huron water. The resulting regression 
equations can be used to map the concentration and distribution of water 
quality parameters throughout the bay, given the appropriate LANDSAT 
measurements. These parameters need not be directly detectable by 



LANDSAT, provided their concentration is correlated with some water 
characteristic that is detectable. The predicted values for each water 
quality parameter can be displayed on a TV monitor and color -coded 
and mapped onto film. Thus, LANDSAT monitoring, as an adjunct to 
conventional point- sampling, provides an economical basis for extra- 
polating water quality parameters from point samples to unsampled 
areas and provides a synoptic view of water mass boundaries that no 
amount of point sampling could provide. 
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